1 |
Word Sense Induction with Attentive Context Clustering
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03586559 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Word Sense Induction with Attentive Context Clustering
|
|
|
|
In: https://hal.inria.fr/hal-03586559 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Word Sense Induction with Attentive Context Clustering
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03586559 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Multi-sense Embeddings through a Word Sense Disambiguation Process
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Crowdsourcing lexical semantic judgements from bilingual dictionary users
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Uso de representaciones vectoriales de las palabras para la detección de dobles sentidos (puns)
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Unsupervised all-words sense distribution learning
|
|
|
|
Abstract:
© 2016 Andrew Bennett ; There has recently been significant interest in unsupervised methods for learning word sense distributions, or most frequent sense information, in particular for applications where sense distinctions are needed. In addition to their direct application to word sense disambiguation (WSD), particularly where domain adaptation is required, these methods have successfully been applied to diverse problems such as novel sense detection or lexical simplification. Furthermore, they could be used to supplement or replace existing sources of sense frequencies, such as SemCor, which have many significant flaws. However, a major gap in the past work on sense distribution learning is that it has never been optimised for large-scale application to the entire vocabularies of a languages, as would be required to replace sense frequency resources such as SemCor. In this thesis, we develop an unsupervised method for all-words sense distribution learning, which is suitable for language-wide application. We first optimise and extend HDP-WSI, an existing state-of-the-art sense distribution learning method based on HDP topic modelling. This is mostly achieved by replacing HDP with the more efficient HCA topic modelling algorithm in order to create HCA-WSI, which is over an order of magnitude faster than HDP-WSI and more robust. We then apply HCA-WSI across the vocabularies of several languages to create LexSemTm, which is a multilingual sense frequency resource of unprecedented size. Of note, LexSemTm contains sense frequencies for approximately 88% of polysemous lemmas in Princeton WordNet, compared to only 39% for SemCor, and the quality of data in each is shown to be roughly equivalent. Finally, we extend our sense distribution learning methodology to multiword expressions (MWEs), which to the best of our knowledge is a novel task (as is applying any kind of general-purpose WSD methods to MWEs). We demonstrate that sense distribution learning for MWEs is comparable to that for simplex lemmas in all important respects, and we expand LexSemTm with MWE sense frequency data.
|
|
Keyword:
natural language processing; NLP; semantics; topic modelling; word sense disambiguation; word sense induction; WSD; WSI
|
|
URL: http://hdl.handle.net/11343/148422
|
|
BASE
|
|
Hide details
|
|
12 |
Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources
|
|
|
|
In: Cognitive Studies | Études cognitives; No 15 (2015); 269-292 ; 2392-2397 (2015)
|
|
BASE
|
|
Show details
|
|
13 |
Translation of keywords between English and Swedish ; Översättning av nyckelord mellan engelska och svenska
|
|
|
|
BASE
|
|
Show details
|
|
14 |
ABSTRACT Long Tail in Weighted Lexical Networks
|
|
|
|
In: http://hal.inria.fr/docs/00/81/62/36/PDF/COGALEX2012-ML-v4.pdf (2013)
|
|
BASE
|
|
Show details
|
|
15 |
Désambiguisation de sens par modèles de contextes et son application à la Recherche d’Information
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Désambiguisation de sens par modèles de contextes et son application à la Recherche d’Information
|
|
|
|
BASE
|
|
Show details
|
|
17 |
SemEval-2010 Task 2: Cross-Lingual Lexical Substitution
|
|
|
|
In: Association for Computational Linguistics (ACL) Workshop on Semantic Evaluations (SemEval), 2010, Uppsala, Sweden (2010)
|
|
BASE
|
|
Show details
|
|
18 |
Word sense disambiguation: a survey
|
|
|
|
In: http://www.dsi.uniroma1.it/~navigli/pubs/ACM_Survey_2009_Navigli.pdf (2009)
|
|
BASE
|
|
Show details
|
|
19 |
Regular Polysemy in WordNet
|
|
|
|
In: ISSN: 0175-1336 ; Journal for language technology and computational linguistics ; https://hal.archives-ouvertes.fr/hal-00611244 ; Journal for language technology and computational linguistics, GSCL (Gesellschaft für Sprachtechnologie und Computerlinguistik) 2009, 24 (2), pp.5-18 (2009)
|
|
BASE
|
|
Show details
|
|
20 |
The Treatment of Word Sense Inventories in the ‘LACELL WSD Project’
|
|
|
|
In: International Journal of English Studies; Vol. 9 No. 3 (2009): Special Issue; 21-38 ; International Journal of English Studies; Vol. 9 Núm. 3 (2009): Special Issue; 21-38 ; 1989-6131 ; 1578-7044 (2009)
|
|
BASE
|
|
Show details
|
|
|
|